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ABSTRACT 


PARK, STEPHEN KENT. On the equivalence of optimal control problems and 
the transformation of optimal control problems with compact control regions 
into Lagrange problems. (Under the direction of HANS SAGAN.) 

Several natural types of equivalence are defined for a set of optimi- 
zation problems. Using these concepts of equivalence, a method is 
described by which an optimal control problem with a compact control region 
U can be transformed into an equivalent optimal control problem with an 
(in general) arbitrary control region Z. This method assumes the existence 
of a function t which maps Z onto U and satisfies certain continuity 
requirements. It is shown that when U is convex the existence of this 
function is guaranteed and that when U is, in addition, a polyhedron 
various representations for are given. 

The particular case where Z is all of euclidean p- space, for some 
positive integer p, is considered in some detail. It is shown that when 
U is a convex body the optimal control problem is equivalent to (jL-e., may 
be transformed into) an optimal control problem in which the control functions 
are unconstrained. Furthermore, when U is a polyhedron it is shown that 
the optimal control problem is equivalent to a classical Lagrange problem. 

This later result provides, as an application, a proof of the Pontryagin 
maximum principle based upon known necessary conditions in the calculus of 


variations . 
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1 . INTRODUCTION 


On page 2l8 of (22), L. C. Young expresses the following sentiment. 

"We see that there is no real difference between the Lagrange problem and 
optimal control) the latter is simply a more up-to-date formulation. 

Apparent minor differences are sometimes pointed out, but they are really 
quite insignificant." It is interesting to contrast this statement with the 
one made by L. S. Pontryagin et al. on page 239 of (l8) which reads 
" . . . the optimal control problem is a generalization of the problem of 
Lagrange in the calculus of variations, and is equivalent to the latter in 
the case where the control region U is an open set ..." Regardless 
of which of these two philosophies one chooses to adopt, when faced with an 
optimal control problem with a (typically) closed and bounded control region 
it is not at all clear, in most cases, what results from the calculus of 
variations (if any) are applicable to the problem. As early as 19^-8, 

M. R. Hestenes (8) used the calculus of variations to analyze an optimal 
control system in which the control region was defined by a system of 
inequalities. He used the method of "slack variables" - usually attributed 
to F. A. Valentine (21) - and transformed the optimal control problem into a 
Bolza problem. Since that time, this approach has been investigated by a 
variety of authors and a large portion of the research is contained in 
Hestenes' book (9). 

Herein we adopt a procedure for transforming optimal control problems 
into calculus of variation problems which is similar in philosophy to 
Hestenes', but which makes no use of slack variables. The procedure is 
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highly geometric in flavor and places an emphasis on the construction of 
mappings from one control region onto another. Using these mappings , we 
define a rather natural type of equivalence in the set of all optimal 
control problems (of a certain type) . As an immediate by-product we find 
that it is possible to identify a large class of optimal control problems 
which are, to within a transformation, Lagrange problems in the calculus 
of variations. 

The procedure mentioned in the previous paragraph may be illustrated 
as follows . Consider the problem of optimally steering a control system 
with the control function u subject to the constraint u(t)e U. Intui- 
tively, one would believe that, at least if i|r is continuous, one-to-one, 
onto and has a continuous inverse, this problem is equivalent to the 
problem of optimally steering the same system with a control function z 
subject to the constraint z(t)e Z where u(t) = \|/(z(t)) and maps Z onto 
U. This idea is very briefly mentioned on page 2l8 of (22) and Young 
picturesquely concludes that "this corresponds to providing a new set of 
dials z, which are used to control the original dials u, as in regulating 
a television set without getting up from one's arm chair." 

To be more specific, the basic problem with which we will be concerned 
is the following one. Suppose an optimal control system is given in which 
the control (vector) function is constrained to lie in a compact control 
region contained in euclidean m- space R m . We seek to determine conditions 
sufficient to guarantee that there exists a positive integer p, a subset Z 
of and a function i|r from Z onto U which together transform the 

original control system into a new control system, with control region Z, 
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which is (in some sense) equivalent to the original. Having done this we 
seek additional conditions which will permit us to choose Z = and 
thereby transform the original optimal control problem into an equivalent 
Lagrange problem. 

In chapter 2 we consider a very broad class of optimization problems 
and discuss two concepts of equivalence in this class. In antici- 
pation of investigating the questions mentioned in the previous paragraph, 
we also define what is called, for lack of a better name, ^-equivalence. 

It turns out that ^-equivalence is stronger than one of the previously 
mentioned concepts of equivalence; namely, if an optimization problem Pg 
is ^-equivalent to an optimization problem P^ then Pg and P^ are 
equivalent. In the context of the previous paragraph ^-equivalence is a 
very natural concept. For it turns out that if the original optimal 
control problem, say P^, is transformed into a second problem Pg and if 
Pg is 'I'- equivalent to P^ then the following is true: if u:[t Q , t^] ->U 
is a (bounded, measurable) optimal control function for P^ then there 
exists at least one (bounded, measurable) optimal control function 
z:[tQ, t^] -> Z for Pg such that u = ^°z (where o denotes function 
composition); conversely if z:[tg, t^] -» Z is an optimal control for Pg 
then u = foz is an optimal control for P . Therefore, we ask the 
question, under what conditions is the transformed problem Pg ^-equivalent 
to the original problem P^ . Theorem 2.2 and corollary 2.1 give a complete 
answer to this problem i...e., necessary and sufficient conditions for Pg 
to be \t- equivalent to P^. From these results we obtain corollary 2.2 which 
serves as the basis of the investigation in chapter 3. 
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The principle result of chapter 3 is theorem 3-4 which relies heavily 
on Filippov' s implicit function lemma (5) and corollary 2.2. The theorem 
says that if there exists a positive integer p, a set ZCRP, and 
continuous function i|f;Z -> U which satisfies \|r(Z) = t(Z*) = U where 
Z*CZ is compact, then P 2 is ^-equivalent to P . It is a tribute to 
the strength of Filippov' s lemma that theorem 3.4 remains true without any 
reference to the structure of the sets t _1 (u) where u e U (except, 
of course, for the structure of t _1 (u) induced by the continuity of ty) . 
Specifically t is not by necessity one-to-one which illustrates that, in 
the space of control functions, the normal (ji.e^., is a homeomorphism) 
concept of a change in variables is definitely overly restrictive. 

Chapter 4 is an application of theorem 3.4 with Z = R P and 
continuously differentiable. Specifically, we show that in many cases 
there exists a Lagrange problem which is ^-equivalent to the original 
optimal control problem. In chapter 5 we use the results of chapter 4 to 
explore further the relationships between the theory of optimal control 
and the calculus of variations. The principle result here is that, using 
the multiplier rule, Weierstrass condition, transversality conditions and 
the concept of ^-equivalence, we are able to obtain, for certain compact control 
regions U, a proof of the Pontryagin maximum principle. Since this proof 
serves as an illustration of the material contained in chapters 2, 3j 4, and 
6, it is by necessity not ideally organized. Certainly if our sole objective 
had been a proof of the maximum principle much of the material in each chapter 
could have been eliminated and the remaining material reorganized. 
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Since the material of chapters 2 through 5 rests on the assumption that 
a set Z and an onto function t: Z -» U exists we turn to this question in 
chapter 6. We show that when U is a convex body in R m then the hypotheses 
of theorem 3.4 are satisfied. In the less general case where U is a 
polyhedron we explicitly construct such a t which is, in addition, continu- 
ously differentiable with Z = R P . Finally in the simple case where U is 
a parallelipiped we show that \|r assumes a very simple form. 

In the appendix we consider the question of local, as opposed to global, 
^-equivalence. The discussion parallels chapter 2 and we obtain the local 
analogues of theorem 2.2 and corollary 2.1. We close with a brief discussion 
of an application of the transformation approach outlined herein to the 
problem of minimizing a real valued function of n variables. 

Since the idea of a change of variables in the space of control functions 
is such an intuitive one, it is reasonable to assume that it has been 
considered by others. This is, in fact, true for, as pointed out in chapter 6, 
it is common practice (for example) to begin the statement of an optimal 
control problem with a phrase such as "let the control region U be a right 
parallelipiped and assume without loss of generality that U = [-1, 1] 

This statement is based on the (implicit) fact that there is a (linear) 
homeomorphi sm from [-1, l] m onto the right parallelipiped and the (implicit) 
assumption that a change of variables induced by a homeomorphism preserves 
equivalence. The idea of transforming an optimal control problem into a 
Lagrange problem (in the manner outlined herein) is far less frequently 
encountered. However, the idea is mentioned, for example, in (2) and (l4) 
and on pages 134-138 of (4). Typically these references use U = [-1, 1] 
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with Z = R and \|f(z) = sin z (or the immediate generalization of this idea 
to the case U = [-1, l] m ) coupled with the (implicit) assumption that the 
transformed problem is equivalent to the original. 
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2. THE EQUIVALENCE OF OPTIMIZATION PROBLEMS 

Let A be a suitable index set and suppose that for each a £ A 

there corresponds a set Y , a nonempty subset ICY, and a functional 

-> R where R denotes the reals. Despite the fact that may 

have no algebraic or topological structure one can still consider the 

following global problem : to find an o° e ft such that F A (o°) ^ F (a) 

a A A 

for all a e ft^. Henceforth, for A e A, we will refer to this problem as 
refer to the elements of ft^ as admissible elements and refer to the 
element a e ft^ (if it exists) as a global optimal ( admissible ) element . 
Furthermore, if a global optimal element (elements) exists for P^ , we will 
frequently refer to it (them) as a solution ( solutions ) although in the 
case where no global optimal elements exist the term solution denotes, instead 
a proof of this fact . Finally , since we will always be concerned with global 
(as opposed to local) properties we adopt the convention that in the absence 
of a modifier such terms as optimal element, optimization problem and 
equivalence should be understood in the global sense.' 1 ' 

The formulation of the preceding problem is sufficiently general to 
guarantee that a wide variety of what could be loosely referred to as optimi- 
zation or minimization problems are included in the set F = j P ; A e a } . 

\ 

Specifically P contains the rather general optimal control problem to be 
formulated in chapter 3 and the Lagrange problem to be formulated in chapter k 
When discussing optimal control and/or Lagrange problems the notion of 


In the appendix the material of this chapter is interpreted in terms 
of a local problem. 



equivalence of problems P^ and P^ for A, p e a is frequently 

mentioned - in several different contexts. Unfortunately, these notions 

of equivalence are often only vaguely (if at all) defined. Furthermore , 

equivalence may not mean equivalence in the strict mathematical sense, 

i_._e. , in the sense that P and P are equivalent, written 

P^ = P , if 
1 p' 


is an equivalence relation on T. With this in mind 


let us first define and then discuss the following equivalence concepts. 

Definition 2.1 : For A £ A, let °Cft denote the set of optimal 

A A 

elements for problem P . 

A 

Definition 2.2 : For A, p e A if there exists a function -» 2 

P A 

such that F = F ° 'i r then 
P 1 

(1) P. is strictly equivalent to P, if is one-to-one and 

M* A 

(2) P^ ^-equivalent to if ^ 

(3) P is weakly ^-equivalent to P if °) = 0 , °. 

In other words P is strictly equivalent to P if is a one-to-one 
P A 

correspondence between the admissible elements of P and P , P is 

P A P 

'^equivalent to P^ if \f r maps the admissible elements for P^ onto the admissible 

elements for P^, P^ is weakly ^-equivalent to P^ if 'S f maps the 

optimal elements for P onto the optimal elements for P , and in each 

P A 

of these cases, corresponding admissible elements have the same functional 
value. As an immediate consequence of this definition we have the following. 
Remark 2.1 : Strict equivalence is an equivalence relation in F. 


Thus we have that the ordering of P, and P in the statement " 

A p 


P is 
P 
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strictly equivalent to P^ " is inconsequential. In contrast to this 

situation we see that 'P- equivalence and weak ^P -equivalence do not 

define equivalence relations on P. However, it is true that for A e /\, 

is both ^-equivalent and weakly 'P-equivalent to itself (SP is the identity 

on ft^) . Also the following is true. 

Remark 2.2: For A, p, v e A if P is 'P -equivalent (weakly) to 

M- 

and Py is <P -equivalent (weakly) to P then P is "'P °3>-equivalent 

(weak) to P^ where o denotes function composition. 

In other words the concepts of *P - equivalent and weak ’'P- equivalence 

define relations in r which are reflexive and transitive but are not, in 

general, symmetric. It is for this reason that throughout we will make a 

careful distinction between the statement "P^ and are strictly 

equivalent" and the weaker statements "P^ is \P -equivalent (weakly ^P- equivalent) 

P, " - in addition we will be careful to observe the order of terms P and 
A p 

P in the later statements. 

A 

It is clear that strict equivalence is a stronger concept than 


^-equivalence. The following theorem says that conversely if P is 
'P- equivalent to P^ then there exists a "subproblem" P^ contained in 

which is strictly equivalent to P . 

A 


P 


P 


Theorem 2.1 : If P is 'P -equivalent to P then there exists a 


nonempty subset ft C ft such that ft and F = 
v jj. v v 

restriction of F to ft ) define a problem p 
p v' v 

equivalent to P . 

A 


F I A ( i .e . , the 
P|^v 

which is strictly 
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Proof: Since \Ehft -» ft, is onto, for each a e ft, , the set 

“ — — M- A ' A 

is nonempty. Hence (by the axiom-of- choice) there exists a 

function $:ft uj\I> ^(a):a eft} such that $(a)e ^(a) for each 

a e ft^ i^e., \fr(0(a)) = a. Let ft^ = {^(a^a e ft^ } then by construction 

ft v C ft^ and ft^ / 0. Furthermore, the function <l> is one-to-one since 

$(a) = $(a*) implies a = ^($( 0 ,)) = ^($(a*)) = a*. Let F = F and 

V ^ ft v 

thereby define P . If a e ft, then <J>(a)e ft C ft and hence 
v A ' v p 

F (0(a)) = F (0(a)) = F (^o(a)) = F.(a) 
v M- A A 

i.e., F ,,°0 = F . Since O (ft ) = ft , the theorem is proved, 
v A A v 

Let us turn to the concept of weak O- equivalence. Following some 
preliminary lemmas we obtain (theorem 2.2) an alternate characterization of 
weak "0- equivalence and as a corollary we obtain the expected result that 
^-equivalence is indeed stronger than weak ^-equivalence. Hereafter let 
us avoid reference to the general index set A and instead simply refer 
to the optimization problems as P^, Pg and so forth. Furthermore, in the 
hypotheses of the following three lemmas let Pg be defined by ftg and F 2 = 
F^ ° jfr where 'Ihftg is not necessarily onto). 

Lemma 2.1 : If a e ft^, p e ft^ and ^(p) = a then a e ft° implies 

P e fi°. 

Proof : Consider p* e ft^ and a* = \]>(p*)e ft^. By hypothesis 

F-^(a*) > F^(a) and thus 

Fg(p^) = F 1 Wp*)) = F ] _(a*) > F 1 (a) = F^p)) = Fg(p) 
which establishes the lemma. 
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Lemma 2.2 : 'T' (2°) 3 2° iff^(2 2 )2fi°„ 

Proof: If 2° = 0 the lemma is trivially true. Therefore assume 

^ 0 0 and consider a° e 2°. If ^(SV,) 2 2° then there exists p° e 2g 
such that ^((3°) = a°. From lemma 2.1 we have p° e 2° and thus ^ (2°) 22 °. 
Conversely ^(2^) 22° implies 2 2° immediately since 

Lemma 2.3 : 2° 0 0 and ^(2°) c 2° iff 2° 0 0 and H 2° 0 0. 

Proof : If 2° 0 0 and ^(^ 2 ) ^ 2° 0 0 then there exists an a° e 2° 
and 3° e 2g such that ^(p°) = a°. From lemma 2.1 3° e 2° i.e., 2° 0 0. 
Furthermore, if 3 e 2° then F 2 (P) = F 2 (P°)- Lettin S a = ^(p) then 
Pj_( a ) = ^(p) = F 2 (p°) = F^(a°) . Since a° e 2° this implies a e 2° and 
thus ^(2°)C 2°. Conversely, if 2° 0 0 and 3^(2°) c 2° then there exists 
3° 6 2° and thus a° =^>(p°)€ 2°. Therefore, ^(2°) D 2° 0 0 which implies 

*(« 2 ) n 2° 4 0 . 

Combining lemmas 2.2 and 2.3 we obtain the following theorem which gives 
a necessary and sufficient condition for P 2 to be weakly ^-equivalent to P 1 

Theorem 2,2 : Suppose 2° 0 0. P 2 is weakly ^-equivalent to P^ 

iff \fr(2 2 )f>2°. If instead 2° = 0 then P^ is weakly ^-equivalent to P^ 
iff 2° = 0. 

Proof : If 2° = 0 the theorem follows as a restatement of definition 2.2 

If 2° 0 0 and P^ is weakly ^-equivalent to P^ then M>(2°) = 2° _i._e., 

3f0 2g) - 5 2° since 2 2 2 2°. Conversely, if 2° 0 0 and ^ (2 2 ) 2 2° then from 
lemma 2.2 (2°) 22°. Furthermore, 2° 0 0 implies \&(2 2 ) O 2° 0 0 and 

thus from lemma 2.3 3&(2°)C2° ji.e., P^ is weakly ^-equivalent to P^. 
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Corollary 2,1 : If Pg is ^-equivalent to P^ then Pg is weakly 


\E r - equivalent to P^ = 

Proof : If ^ 0 then theorem 2.2 applies since 2° C . if 

= 0 suppose there exists (3° e and let a° = \[>((3 0 ) , Consider 
a e then by hypothesis there exists p e such that \l>(p) = a . Since 
Fg(p) Fg(p°) this implies F^(a) ^ Fj L (a°) and since a was arbitrary it 
follows that a° e This contradiction establishes = (j and from 

theorem 2.2, Pg is weakly ^-equivalent to P^. 

As mentioned in the introduction our primary interest herein will be 
concerned (in general) with transforming a given optimal control problem 
(P^) into a second problem (Pg) which is in some sense equivalent to the 
former. Before considering this question in detail let us consider one ‘ 
final concept of equivalence. 

Definition 2.3 : Problems P^ and Pg will be said to be equivalent if 

there exist functions ^ 12 :fi l ^2 and ^21 : ^2 ^1 which satisfy 

F 2 “V*21 and F 1 ’ VV 

Notice that this does, in fact, define an equivalence relation on T. 
Furthermore, if and Pg are strictly equivalent there exists a 

one-to-one, onto function 'Jhftg -» satisfying Fg = F^ ° \E0 then using 


'F, 


-1 


-1 


~ ^ and ^2 = ^ we have F^ = Fg ° SP” and P^, Pg are equivalent. 
That is, strict equivalence is stronger than equivalence. As we observed 
previously, 'P- equivalence is not an equivalence relation on P but in the 
sense of theorem 2.1, 'P- equivalence is "almost" strict equivalence. From 

the proof of theorem 2.1 we obtain the following result. 
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Theorem 2,3 : If Pg is ^-equivalent to P 1 then P 1 and P 0 

are equivalent (in the sense of definition 2.3). 

Proof: Referring to the proof of theorem 2.1 (with 1=1, 2 = p, 3 = v) we 

see that : ftg-^> ft^ and $: ft C ftg satisfy the conditions of definition 2.3- 

With regard to the previously mentioned problem of, given P , constructing 
a problem Pg which is in some sense equivalent to P-^ the procedure which 
we will pursue is clear. Starting with P^ we construct a set ftg and a 
function ^:ftg ->0^ If we define Fgiftg -> R as F 2 = \ ° ^ and lf the 
function ^ is onto, i.e^Sf^ftg) = ^ then P 2 is ^-equivalent to P ± 
and hence P^, Pg are equivalent. Of more importance is the fact that P 2 
is weakly ^-equivalent to P^ and as a consequence the following is true: 
if ft^ = j then ftg - f-, if 2° j then for each a°e 2° there exists at least one 
P° e ft° such that ^(p 0 ) = a°j finally if p° e 2° then ^ f (p°)e 2° i..e., 

^(P°) is an optimal element for P . Thus, if we analyze P 2 we have, to 
within a transformation (namely \I>:ftg ->2^), at the same time analyzed P . 

Since we will refer to this idea repeatedly, we record the previous discussion 
as the FUNDAMENTAL CONSTRUCTION LEMMA which is the following. 

Corollary 2.2 : If there exists a set Yg, a nonempty set ftgCYg and 

a function ’J'-.ftg -> with ^ onto (jL.js., ^ (fig) = ft ) then (Yg, ftg, Fg) 

define a problem Pg which is ^-equivalent to P^ where Fg = F^ ° \l r . 

It is this corollary that will be the basis of the discussion in 
chapters 3 and 4. 
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3. ^-EQUIVALENCE IN A CLASS OF OPTIMAL CONTROL PROBLEMS 


The discussion in this chapter will be based upon the abstract theory 
of ^-equivalence developed in chapter 2. Specifically, we will be concerned 
with corollary 2.2 and its application to the theory of optimal control. 

To avoid lengthy introductory remarks we adopt the convention that, unless 
explicitly stated otherwise, the terminology and notational conventions of 
Pontryagin et al . (l8) will be used throughout. (Although with regard to 
vector notation, we follow a convention opposite to theirs concerning 
subscripts and superscripts.) 


3.1 The Formulation of Problem P^ 

r 

For any positive integer r let R denote euclidean r- space and let 
n and m be fixed positive integers. Suppose that the phase space X is 
all of R n and that an initial point a and nonempty target set S are 
given in X along with an initial time t Q e R. In addition let the control 
region U be a nonempty subset of R m . We assume that S is closed, a j. S, 

and U is compact (jL.ei., closed and bounded). Finally, we assume the real 

Bgi _ 

valued functions g^ and are defined and continuous on X X U for 

<3 

i = 0, 1, 2, ..., n and j = 1, 2, ..., n and we let g = (g-j_, . .., g n ). 

The optimal control problem designated P^ will be formulated in terms of 
the quantities mentioned in this paragraph. In anticipation of this, define 
the following: 

Definition 2.1 : Y 1 = |u = ( 1 ^, ..., u ffl ): u is measurable and 

u:[t Q , t*] -» U for some t* > t^ | 
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We remark that measurable and integrable are meant in the Lebesgue 
sense and that a vector function is measurable (integrable) if each of 
its components is measurable (integrable). Furthermore, since U is 
bounded it follows that if u e Y-^ then u is integrable. 

The following lemma is a standard result in analysis (see, e_.g., ( 19 ) 
page 39 ) > but since it will be used repeatedly we choose to label it lemma 3*1 
to facilitate later reference to its contents. 

Lemma 3 » 1 : If a: A -> R^ is continuous for 0 ^ ACR V and positive 

integers v, p then for each bounded, measurable function |3:[tQ, t^] -» A the 
function a,o(3:[tQ, t^] -* R^ 1 is integrable. 

Returning to the optimal control problem let us define some standard 
terminology . 

Definition 3-2 : If corresponding to a finite t^ > t Q and function 

uift^, tj_] U In Y 1 there exists an absolutely continuous function 
x:[tg, t^] -> X such that x(t) = g(x(t), u(t)) a.e. on [t Q , t^] , x(t^) = a 
and x(t-^)e S then u is an admissible control (function), x is an 
admissible trajectory , and t^ is a terminal time . 

Since it requires x(t^)e S, this definition of an admissible control 
is stronger than the one given by Pontryagin et al. However (as observed by 
many authors), a control which does not ultimately steer the trajectory to 
the tar-get set cannot be an optimal control and thus there seems to be little 
value in considering a control which is not admissible in the sense of 
definition 3-2 for some t^ > t^. With regard to definition 3-2 the next 
lemma says that the admissible trajectory associated with an admissible 
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control is unique and that without loss of generality we may take t-^ as 
the least terminal time. The uniqueness of the trajectory is a standard 
result in the theory of differential equations (see (l8), p. 78) and thus 
we only concern ourselves with a proof that there exists a > t Q such 
that x(t 1 )e S and t €[t Q , t^ implies x(t)/ S. 

Lemma 3 -2 : If u:[tQ, t*] -> U is an admissible control and x:[t Q , t*]->X 

is an associated admissible trajectory then t ± = inf | t > t 0 :x(T)e S | > t Q 
is a terminal time and x restricted to [t Q , t ] is unique. 

Proof : Let T = J t > t 0 :x(T)e S } then t* e T and thus 
t^ = inf T > t Q exists. By definition there exists a sequence (T n ) t^ 
with T n e T, i.e., x(r n )e S and T n — Furthermore, x is continuous 

on [t Q , t*] and in particular is continuous at t^ since t Q 5 t^ 5 t*. 
Therefore (x(r n )) -> x^) and since S is closed it follows that 
x(t^)e S jL.e., t^ is a (least) terminal time. Finally, if t Q = t.^ then 
x(t-^) = x(t^) = a which contradicts the assumption a j. S. Thus t^ < t^. 

Lemma 3*3 : If u is an admissible control with the corresponding 

(unique) admissible trajectory x and (least) terminal time t^ then 


F 1 (u) 



g 0 (x(T), 


u(t)) 


dT 


exists . 

Proof : Since x and u are bounded, measurable on [tQ, tp] and since gQ is 

continuous on X*U, the existence of F^(u) follows from lemma 3.1. 
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In other words , lemma 3-3 serves to define a functional F^ on the 
set of admissible control functions. In agreement with the notation of 
chapter 2 we make the following definition: 

Definition 3-3 : Let Y^ denote the set of admissible control 

functions . We will sometimes write, for brevity, (u; x, tj)e 
if u e x is the associated trajectory, and t-^ is the 

terminal time . 

In passing we note that it may seem more consistent with the notation 
of chapter 2 to define as ja:a = (u, x, t^) } where u is an admissible 

trajectory, x is the associated trajectory, and t^ is the terminal time. 
However, since x and t are unique (for each u) this is unnecessary and 
in fact is undesirable since it would require us to adopt some clumsy conven- 
tions concerning the construction of Y 1 . Therefore we use the definition 
of given by definition 3*3 and yield to the previous discussion by 

occasionally writing (uj x, t^)e 

The usual purpose of optimal control theory is not to determine whether 
or not is empty, but rather, given j f, to find a minimal element 

of Therefore we assume ^ 0 and state the following. 

Definition 3-^ : Problem P^ is: find a u° e such that 

F-^(u°) £ F-^(u) for all u e If an optimal element (u°; x°, t°^) exists 

for P^ we call u° an optimal control (function) and x° the associated 
optimal trajectory . Finally, we denote the set of optimal control functions 
as • 
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3.2 The Construction of Problem Pg 

Consistent with the discussion in chapter 2 we have defined P^ and 
now we make the following basic assumption which we assume is satisfied 
throughout the remainder of this chapter. A later chapter (chapter 6) is 
devoted to the question of the existence and explicit construction of a 
set Z and function \|r satisfying the basic assumption for particular 
choices of the set U. 

Basic assumption : there exists the following 

(1) a positive integer p, 

(2) a nonempty set ZCR^ (perhaps Z = ~Bp) , 

(3) and a continuous function 'if-.Z -* U with \|r( Z) = U, i,..e.> 'if 
is onto. 

It will be this set Z and function ^ which will serve as a basis 
for the construction of a set fig, a function ^ '• fig fi-j_ an< ^ problem Pg 

which is ^-equivalent to 

Definition 3-8 : For (x, z)e XxZ define f Q and f = (fp • ••, f R ) as 

f ± ( X , z) = g i (x^ t(z)) 

where i = 0, 1, 2, ..., n. 

The following lemma is an immediate consequence of the properties of g^ 
and the continuity of t . 

8f,- 

Lemma 3.4 ; The real functions f^ and are defined and continuous 

on XxZ for i = 0, 1, 2, n and j = 1, 2, n. 

The functions f and f = (f ± , f n ) satisfy the same hypotheses 

as g„ and g. Thus in terms of (a, t Q ) , X, S and Z we can obtain the 
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following results and definitions by formally replacing g^ by f and U 
by Z in the previous discussion. It is important to note that we have 
not assumed Z to be bounded and as a consequence some changes must be made. 

Remark 3-1 : Yp = { z = ( z p* •••> z p) : z is bounded , measurable 

and z: [t Q , t # ] -> Z for some t* > t j- . 

Remark 3.2 : An admissible control z:[t Q , t-jJ -»Z, admissible trajectory 

x:[t Q , t^] — > X and terminal time t^ are defined analogous to definition 2.2 
with the understanding that z is bounded with x(t) = f(x(t), z(t)) a.e. on 

With this understanding that z be bounded, the analogue of lemma 3.2 
remains true (see (l8) , p. 78 ) so that corresponding to an admissible control 
z, the associated trajectory x is unique and again we take t^ as the 
least terminal time. Furthermore, we sometimes write (zj x, tp)e fig where: 

Remark 3 • 3 : fi ^ denotes the set of (bounded) admissible control functions. 

As the analogue of lemma 3-3 we have the following lemma which defines 

the functional Fg on fig. 

Lemma 3-3 : If (z> x, t^)e fig then 



exists . 

Proof : From lemma 3.4 f is continuous on XxZ and again since x and 

z are bounded, measurable lemma 3- 5 follows from lemma 3. 1. 

While it may not be clear that fig / 0 we will show later that under the 
assumptions made so far this is indeed true. With this in mind, we make the 
following remark. 
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Remark 3 A : Problem P 2 is: find a z° e fi g such that F 2 (z°) < Fg(z) 
for all z e n . If (z°> x°, t° ) is an optimal element for Pg then z° 
and x° are called an optimal control and optimal trajectory, respectively. 

The set of optimal control functions is designated fi° . 

Following the program of chapter 2 we will show that Pg as constructed 
is ^/-equivalent to P^ for an appropriate function 'I': fig As one 

would expect, the function M/, which maps a function z to a function u is 
determined by applying the function at each point z(t) . This idea is 
explicitly defined (definition 3-7) later but first some preliminary work is 
necessary. 

Remark 3.3 : If (z> x, t^e fig and the bounded function z*:[t Q , -> Z 

satisfies z = z* a.e. then (z*j x, t^)e fig and Fg(z) = Fg( z ). An analogous 

statement is true in fi^. 

Remark 3*5 is an immediate consequence of the definition of an admissible 
trajectory and terminal time. It is merely an expression of the intuitive 
idea that changing a control function on a set of measure zero produces no 
effect on the trajectory and functional value. For this reason one can avoid 
distinguishing between two admissible controls which agree except on a set of 
measure zero . 

The following two theorems express the fact that if the admissible controls 

z (for Pg) and u (for P^) are related by t°z = u then they produce the 

same admissible trajectory, terminal time, and functional value (i.e., 

F (z) = F- l (u)). Thus, in a generalized sense, Pg is P^ with a change of 
variables in the space of admissible controls. We will return to a discussion 
of this statement following theorem 3-3- 
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Theorem 3-1 : If 1 z is an admissible control (for P^) with admissible 

trajectory x and terminal time t^ then u = t°z is an admissible control 
(for P^) . Furthermore, x is the (unique) trajectory and t^ the (least) 
terminal time associated with u and F-^(u) = Fg(z). 

Proof : From lemma 3-1 the function u = tyozift^, t^] -» U is 

measurable. By hypothesis x(t) = f(x(t), z(t)) a.e. on [tQ, t^] . Since 
f(x(t), z ( t ) ) = g(x(t), i|f(z(t)))= g(x(t), u( t ) ) and since x(t Q ) = a, 
x(t^)e S it follows that x:[tQ, t^] -»X is the (unique) admissible 
trajectory for u. Furthermore, t^ is the least terminal time for u - 
for if not then x(t)e S for some t e[t^, t^) which contradicts the fact 
that t^ is the least terminal time for z. Therefore (uj x, t^)e fi . 

Finally, 

F 2 (z) = J 1 f Q (x(T), z(t)) dT 
t 0 
r\ 

= j g 0 (x(T),t(z(T))) dT 

l o 

- f 1 g n (x(T), u(t)) dT 
Jt 0 

= Fq (u). 

Theorem 3-2 : If (u> x, t^)e and if there exists a bounded measurable 

function z: [t^, t^] -» Z such that t°z = u then (z;x, t^)e 0 ,^ and 
F 2 (z) = F 1 (u). 
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Proof: As in theorem 3.1 we obtain x(t) = g(x(t), u(t)) = f(x(t), z(t)) 
a • e • and as before we can argue that (z> x, t^e 0 ,^. Similarly, we obtain 
Fg(z) = F 1 (u) . 

Although it is not needed an even stronger result than theorems 3.1 and 
3-2 follows from the same type of argument, the definition of the least 
terminal time and the uniqueness of the trajectories. 

Corollary 3.1 : If (uj x, t^e and (z> x*, t*)e and if 

u(t) = *(z(t)) on [t Q , T ] for T = min | t] _, t* \ then t x = tj, x = x*, and 
F 1 (u) = F 2 (z) . 

3-3 P 2 13 1 i f - Equivalent to 

As indicated by theorem 3* 2 and pointed out in chapter 1 the key result which 
enables us to prove P g is * -equivalent to P ][ is Filippov' s implicit function 
lemma - a lemma which (for several reasons) is of fundamental importance in the theory of 
optimal control. With this lemma in mind we make the following definition. 

Definition 3- 6 : The system (^, Z, U, R) satisfies the measurable IFP 

(implicit function property) if for each interval [t Q , t^CR and (bounded) 
measurable function u:[t Q , t^ -> U there exists abounded, measurable 
function z:[t Q , t^ -> Z such that u = t°z. 

With regard to the statement made prior to remark 3.4 we have the 
following . 

Corollary 3.2 : If (t, Z, U, R) satisfies the measurable IFP then 

n 2 4 0 • 

Proof: By assumption ^ / ( and thus there exists a (u> x, t-^e ft . 

By hypothesis there exists a bounded measurable function z:[t Q , t^j -> Z 
such that i|f°z = u and from theorem 3.2 (z; x, t-^e ft . 



23 


Prior to remark 3*5 there was some discussion concerning the construc- 
tion of a function 'I'-.Dg -* ft . We formally do this now, noting that 
theorem 3-1 justifies the definition. 

Definition 3-7 : Let ^ftg Le defined as ^(z) = u if u = t°z. 

Corollary 3- 3 - If (t, Z, U, R) satisfies the measurable IFP then 
maps fig onto (_i._e., ^(fig) = ft^), and in addition Fg = F^° ^ . 

Proof : That is onto follows immediately from the arguments of 

corollary 3-2. Furthermore, if z e ftg then from theorem 3-1 
Fg(z) = F 1 (\koz) = F 1 (u) (i.e., Fg = #) . 

Combining corollaries 3-2 and 3*3 and interpreting corollary 2.2 in 
terms of the notation of this chapter produces theorem 3*3- 

Theorem 3- 3 : If (t, Z, U, R) satisfies the measurable IFP then Pg 

is ^-equivalent to P^. In particular, the following statement and its 
converse are true: for each optimal element (u°> x°, t°^)e there exists 

an optimal element (z°> x° t^)e ft°g with i|foz 0 = u°. Furthermore, 
t_ L =t 1 ,x = x and F^(u ) = Fg(z ). 

3.L Filippov's Lemma 

Definition 3*6 can be stated diagrammatically in the following form. 

The system (t, Z, U, R) satisfies the measurable IFP if there exists a bounded, 
measurable function z which completes the diagram 



\|r (onto, continuous) 

Figure 3-1- Commutative diagram. 



24 


If Z is compact and if \|/ is one-to-one then is a homeomorphism 
( i_ ^ 7 exists and is continuous). In this case, given u, we have 
z = t 1 °u which by lemma 3.1 is measurable and bounded and hence (t, Z, U, R) 
satisfies the measurable ITT. It is this case, with the additional assumption 
that f be C^ - , which would normally be considered in a change of variables 
discussion. However, Filippov's lemma shows that these assumptions are far 
more restrictive than necessary. 

Rather than prove Filippov's lemma, completely, we refer the reader to 
the original paper (5) pages 78-7 9 , or to an identical proof on pages 30-31 
of (7) • Since we will make a later comment concerning the construction of 
the function z we discuss this aspect of the lemma. If K C R^ is a 
nonempty compact set, then the continuous function a^:K -» R where 
cv ( z) = z i (i._e., z = (z x , ..., z^)) has a minimum on K j-.e., there exists 

X.T- 

at least one point of K whose i 1 " component is least. Thus by applying 
to K we determine an element z^ whose first component is least. 

If there is more than one such element, we apply a 2 to the compact set 
KOo^ 1 (a-^ z 1 ) ) thereby determining from among the set of elements whose first 
component is least, an element whose second component is least. Again, if 
there is more than one such element we continue inductively for 
i = 1, 2, ..., p and in this manner uniquely determine a point in K. 

Lemma 3-8 (Filippov): Under the conditions of the basic assumption, 

namely that U is compact, and t is continuous with \|r(z) = U and with 
the additional assumption that Z is compact, the system (t, Z, U, R) 


satisfies the measurable IFP. 
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Proof: Let u:[t , t^] -» U be a measurable function and for each 
t e[t Q , t x ] let K(t) = t _1 (u(t)) . Since K(t)CZ and since ^ is 
continuous the set K(t) is compact and nonempty (\|f is onto). Let z(t) 
be the unique element of K(t) determined as in the previous discussion 
and in this manner we define a bounded function z:[t^, t^] -* Z such that 
t°z = u. A proof that z is measurable is based on Luzin's theorem and is 
contained in the previously mentioned references. 

Theorem 3. 3 and lemma 3*6 together state that the basic assumption plus 
the compactness of Z are sufficient for ^-equivalence. Frequently (as in 
chapter 4) it is desirable to remove the compactness assumption on Z and 
one way of doing this is by strengthening the assumptions on Doing this 

one obtains the fundamental result of chapter 3 which is the following. 

Theorem J.k : If the conditions of the basic assumption hold (Z need 

not be compact) and if in addition there exists a compact set Z*C Z such 
that t maps Z onto U then Pg is ^-equivalent to P^. Furthermore, if 
(z; x, t-^)e fig then there exists a measurable function z*:[tQ, t^]-»Z* with 
(z*> x, t^)e fig such that ^oz = ifoz* and F 2 (z) = . Finally, if 

(zj x, t^)e fi° (jL.es, z is an optimal control) then (z*j x, t^)e fi°. 

*1 

Proof : By lemma 3*6 the system ( + | 2 *, Z , U, R) satisfies the measurable 

IFP so that (t, Z, U, R) does also since Z*c Z. From theorem 3*3 it follows 

that Pg is ^-equivalent to P^. 

Now if ( z) x, t^)e fig then (u> x, t^)e fi^ where u = t°z. Since 

■X- 

Z , U, R) satisfies the measurable IFP there exists a (bounded) measurable 
function z*:[tQ, t^] -> Z with t°z* = u. From theorem 3*2 

^~The function tl„* denotes the restriction of ty:Z U to Z^C Z 

| Li 
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w y. 

and Fg(z ) = F-. (u) = Fg(z). That the functions i|/oz and t°z are equal 
follows since both are equal to u. Finally, if (zj x, t^)e ^ then since 
we have (z*) x, t^)e and theorem 3*^ is established. 

Paraphrasing the results of the previous theorem, one finds that, under 
the hypothesis of theorem ^>.b, is ^-equivalent to P-^ but without loss 
of generality one can select the optimal controls for P^ (if any exist) 

from among the set of (bounded) measurable functions whose range is contained 

* 

in Z . 

Before closing this chapter, some observations are appropriate. First, 
if the hypotheses of theorem 3*^- are satisfied with Z = R^ then P^ has 
been transformed into a problem Pg whose control region is the whole space 
R^. As is illustrated in chapter 4, this provides a bridge between the 
calculus of variations and the theory of optimal control i^.e., a direct 
method is available for relating the results of the calculus of variation, 
(e.g., sufficiency theorems, Hamilton-Jacobi theory and, of course, necessary 
conditions) to problems of optimal control. In addition, since many numerical 
algorithms are based on the (implicit or explicit) assumption that the 
control region is open, this provides a method of applying these algorithms 
in the case where U is compact. 

Second, as indicated in (6), there may be some advantage to replacing 
a problem whose control region has corners by one whose control region is 
smooth (in some sense). From theorem 3-3 (or 3*^ with Z = Z ) if we may 
choose Z as, for example, the closed unit ball in R^ this goal is 
accomplished. We will say more about this in chapter 6. 
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Third, Filippov's lemma recently has been the subject of research, see 
for example (12), (l6) . Particularly the paper of McShane and Warfield (l 6) 
generalizes the lemma to include the case where Z is a separable metric 
space, U is a Hausdorff space, and [t Q , t-J becomes a subset of a measure 
space. Thus, in our case, it appears that we could relax the hypotheses 
of theorems 3*3 and 3.4. However, since the original version ( 5 ) will prove 
to be completely adequate for use herein, and since the compactness of Z 
(or Z*) permits an easy visualization of the construction of z:[t Q , t.] -> Z 
we choose to use Filippov's lemma in its original form. In fact, lemma 3 .6 
is actually less general than the original and is an immediate corollary of 
it. We remark that the original was intended for use in an existence theorem 
and that its application herein serves to underscore the fundamental role 
this lemma plays in the theory of optimal control. (See the discussion on 
pages 293-297 of (22)). 

Finally, loosely speaking, all of what was done in this chapter could 
be interpreted in terms of piecewise continuous (as opposed to bounded, 
measurable) controls. However, to do this it would be necessary to obtain 
the piecewise continuous analogue of Filippov's lemma (_i.e., if u is 
piecewise continuous under what additional assumptions (if any) on U, Z, 
and t can we assume that z is piecewise continuous) . 
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4. LAGRANGE PROBLEMS AS OPTIMAL CONTROL PROBLEMS 

On several occasions in chapter 3 it was mentioned that we would be 
interested in transforming a problem P^, with a compact control region, 
into a problem P^ with R^ as its control region. This is the idea which 
will be pursued in this chapter and with this in mind, let us strengthen the 
basic assumption of chapter 3 to read: the set Z is all of R^ and there 

■¥r 

exists a compact subset Z C Z such that maps Z onto U where U 
is compact and connected. We remark that since Z = R^ is connected and 
t(z) = U, then by necessity we must assume U is connected. This set of 
assumptions will be referred to as the strengthened basic assumption . 

Remark 4.1 : If the strengthened basic assumption is satisfied then 

theorem 3-4 applies. 

In chapter 6 we show that for certain types of compact, connected sets 
U it is possible to find a positive integer p, continuous function ty, and 
compact set Z* which satisfy the strengthened basic assumption. These 
results combined with remark 4.1 theorem 3*4) verify that for a rather 

arbitrary control region U we may transform P^ into a problem P^ which 
is ^-equivalent to P, and which has an unconstrained control region. While 
in many instances this may be adequate, if one wants to cast problem P^ into 
the traditional framework of the calculus of variations some additional 
assumptions are necessary. We list these in the following paragraph. 

Throughout the remainder of this chapter assume that the strengthened 
basic assumption is satisfied and that in addition the function t is CT^ 
on Z = R^. Furthermore, suppose there exists an open set V C R m with UCV 
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such that the functions g Q and g = (g-j_, ... g ) are C 1 on XxV. With 
these additional assumptions the following is true. 

Lemma 4.1 : The functions f Q and f = (f , . .., f n ) (definition 5 . 5 ) 

are on X*Z (i_.£., on R n+ ^) . 

For convenience let us restate problem P 2 in the form it will assume 
throughout the remainder of this chapter. 

Problem P 2 : To find a bounded, measurable control function 

z = ( z p.> •••> z °p) with z°(t)e Z = R^, a corresponding (unique) absolutely 
continuous trajectory x° = (x^, . .., x° n ) and a (first) terminal time 
t 0 ^ > t^ such that 


(1) x°(t 0 ) = a 

(2) x°(t° 1 )e S 

(3) x°(t) = f(x° 1 (t), x° n (t), z° 1 (t), . .., z° p (t)) a.e. on [t Q , t^] 


and such that 



0 



minimizes 


r 2 ( z ) 



z( t ) ) dT with respect 


to all function x, z and terminal times t p satisfying these conditions. 

Since X x Z = R 11 ^ and since the functions f , f are on X x Z 

it is well known (see ( 18 ), chapter V) that P^ is "equivalent" to a certain 
Lagrange problem in the calculus of variations. We show herein that this 
Lagrange problem is in fact ^-equivalent to . Since one of the primary 
objectives of this paper is to illustrate how one can transform an optimal 
control problem into a calculus of variation problem - thereby correlating 
known results in the two fields - we will go into some detail in the following 


discussion. 
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For arbitrary vectors y = (y^ .... y n+p ) and y = (y^ .... y n+p ) in R n+P 
define functions 

e i (y ’ y) = f i (y i> V y „+i' v P ) 


for i = 0, 1, 2, . . n and 


y) = y ± - 0 ± (y, y) 


for i = 1, 2 , . . . , n. 


(Later we will introduce a function y and its derivative y which are 
not to be confused with the arbitrary vectors y and y used here.) The 
next remark follows immediately. 

Remark 4.2 ; The functions 0 Q , 0 = (0^ ..., 0^) and cp = (cp , . .., cp n ) 
are on R n+P x R n+P . 

In terms of the functions 0^ and 9 let us define a Lagrange problem 
and then prove that it is 'I'- equivalent to P . Since this is the only 
Lagrange problem considered in this paper, it will be referred to as the 
Lagrange problem or sometimes as problem . Consistent with the notation 

of chapter 2 and the terminology of the calculus of variations let X^ denote 

3 

the set of all absolutely continuous functions y = (y„ , .... v )• 

1 ’ “'n+p' 

[t Q , t*J -» R for some t^ > t Q such that 


y(t) 


w(t) dT + y*" 


0 
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for an arbitrary vector y°e R n+P and bounded, measurable 


function 


w:[t 0 , t # ] -»R' 


n+p 


By convention we agree to call the function w the 


derivative of y and in this way extend the definition of y to all of 

cl * ci 

[t Q , t*] i._e., if y e Yj then ^ y exists a.e. and y = — y a.e. for 

a bounded measurable function w:[t Q , t*] R n+P with w = y. 

Definition 4.1 : If corresponding to t^ > t^ there exists a function 

y:[t Q , t^] -> R n+P in Y^ such that 


a.e., 


and 


0 = q>(y(t), y(t) ) 


y(t Q ) = (a x , ..., a n , 0, 


,, 0 ) 


y(t 1 )e SxR P 

then y is said to be an admissible arc and t^ is a terminal time. 

As in lemma 3*2 we can assume without loss of generality that t^ is 
the least terminal time and, as in lemma 3-5> that the functional 

Fj(y) = J 1 9 Q (y(T), y(r) dT 
t 0 

is defined for each admissible arc. Let fi, c Y, denote the set of admissible 

3 3 

arcs so that F, is a functional on fh and problem P., becomes: find a 

3 3 3 

y°e such that F^(y°) ^ F^(y) for all y e fi . If such a y° exists 
we call it a minimal admissible arc and we let denote the set of all 

such arcs. As noted in chapter 2, P^ is a global problem and since much of 
the calculus of variations is concerned with local (as opposed to global) 


minimal arcs the global nature of P^ should be kept in mind to avoid possible 


confusion. 
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If y e fi^, with terminal time t^ define functions x:[t^, t^] -> R n 

and z:[t Q , tj] R P by x^ = y^ for i = 1, 2, . .., n and z ± = y^ +i 

for i = 1, 2, . .., p. Since cp(y(t), y(t)) = 0 a.e. it follows from the 

definition of cp that x(t) = f(x(t), z(t)) a.e. and that (zj x, t- L )e fig. 

Also from the definition of 0 Q we have F^(y) = Fg(z). Conversely starting 
with (z> x, t^)e fig the function y:[t 0 , t-J -» R n+P defined by 


for i = 1, 2, . .., n and 


y r,+i (t) m l Z i (T) dT 


for i = 1 , 2, . . ., p 

is in fij. Since fig f (f> we have fi / 0 and in addition the following 
lemma is true . 

Lemma 4.2 : The function defined by \I>(y) = z where y^ = x^ for 

i = 1* 2, . .., n and y n+ j_ = for i = 1, 2, . . ., p maps fi^ onto fig and 

3 2 

Therefore, in the terminology of this chapter, corollary 2.2 becomes 
the following. 

Theorem 4.1 : The Lagrange problem (P^) is ^-equivalent' 1 ' to Pg. In 

particular, the following statement and its converse are true: for each 

minimal admissible arc y° and terminal time t^ there exists an optimal 
control z°, optimal trajectory x° and terminal time t*^. Furthermore, 


"In fact ^ is one-to-one and hence P p, P^ are strictly equivalent in the 


sense of definition 2.2. 
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these quantities are related by y°_, = x°^ for i = 1, 2, . . n 
y° , . = z L . for i = 1, 2, .... p and t, = t* . 

Remark 4.2 : From remark 2.2, the Lagrange problem is equivalent 

to where ^ is defined in definition 3*7 and ^ is defined in 

lemma 4.2. 

In chapter 5 we will derive the (Pontryagin) maximum principle for Pg 
using the necessary conditions of the calculus of variations and theorem 4.1. 
From this we obtain the maximum principle for P^ using theorem 3-4. We 
remark that as a consequence of remark 4.2 we could skip the intermediate 
step (Pg) - however, we choose not to do this. 
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5- AN APPLICATION OF EQUIVALENCE 

In this chapter we will apply the necessary conditions of the calculus 
of variations to the Lagrange problem (P^) formulated in chapter 4 and, 
using the fact that P^ is equivalent to derive the (Pontryagin) 

maximum principle for P g . Furthermore, by using the fact that P^ is 
^-equivalent to P^, we ultimately obtain the maximum principle for P . 

This result (i_._e., a proof of the maximum principle for the original optimal 
control problem) should not be interpreted as an implication that the material 
of chapters 2 through 4 is no more than preliminary to the material of 
chapter 5. On the contrary, chapter 5 is included in this paper because it 
contains an immediate application of the results of chapters 2, 3 , 4, and 6 and 
because the material of this chapter serves to indicate the basic foundations 
required for a study of the correspondences between the calculus of variations 
and optimal control theory. 

5.1 Necessary Conditions From the Calculus of Variations 
In chapter V of (l8), Pontryagin et al. consider a Lagrange problem and, 
in effect, by transforming the Lagrange problem into an optimal control 
problem they obtain the Weierstrass condition and multiplier rule from the 
maximum principle. Thus, their discussion is parallel, but opposite in 
direction, to the initial developments of this chapter (namely, to the 
derivation of the maximum principle for P g , starting from P^) . This 
paragraph has been included solely for the convenience of the reader who, if 
he consults (l8), should be careful to observe that (in our notation) the 
Lagrange problem of Pontryagin e_t al. is slightly more general than the 



35 


Lagrange problem considered herein. Specifically, they permit the functions 
e 0 > •••> e n to be dependent not only on y^, . y n but also on 

y n+1 > ’ • • > y n+p - Furthermore, 0 Q may be explicitly dependent on 

y r y n - 

For a long time the standard reference on the classical problem of 
Lagrange has been part II of (l) . A more modern introduction to the (classical) 
theory is contained in (20) , which, incidentally, contains a discussion 
(pages 316-323) similar in content to the discussion of (l8), chapter V. 

Also ( 9 ) contains a great deal of information on the Lagrange problem and 
its relation to the theory of optimal control.^" However, all of these references, 
and in addition, many others (with the notable exception of (22)), discuss the 
Lagrange problem with the assumption that the trajectories are at least 
piecewise smooth, rather than absolutely continuous. Furthermore, in order 
to elevate the Weierstrass condition, multiplier rule and transversality 
conditions to the same level of generality as the maximum principle these 
necessary conditions must be established without any assumptions regarding 
normality. For these two reasons, we turn to the work of E. J. McShane and 
specifically to (15), page 2b, for a statement of the previously mentioned 
necessary conditions. Note, however, that this reference deals with necessary 
conditions for a parametric problem and consequently, since is nonparametric, 

the results must be transformed to results for a nonparametric problem. 


In contrast to the transformation method outlined herein, ( 9 ) uses the 
method of "slack variables" - usually attributed to Valentine (21) - to 
bridge the gap between optimal control theory and the calculus of variations. 
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With these introductory remarks disposed of, let us turn to a discussion 
of P^ and the equivalent Lagrange problem. Since a discussion of the 
structure of the target set S would only serve to confuse the fundamental 
results, we let S be a single point, say b e R n , with a ^ b = (bj_, . .., b n ) . 
Therefore, in the traditional spirit of the calculus of variations the terminal 
condition t p free and y(t 1 )e jb }xr p would be written as y^)) = 0 

where the function a = (cr^, . .., cr^) is defined by cu(t, y) = y^ - b^ for 
i = 1, 2 , ..., n. (i.e., a = 0 defines the closed terminal manifold 
[t_, °°)x jb} xrP in (t, y)- space.) For convenience, we state the Lagrange 
problem considered in this chapter. 

Lagrange Problem : For fixed initial conditions (a, t^e R n+ "*" and terminal 

conditions a = 0 find a (first) time > t Q and function y = (y^, ..., y n+p ) 
in Yj such that"*" 

(1) y(t Q ) = (a 1 , ..., a n , 0, ..., 0) 

(2) c(t 1 , y(t x )) = 0 

(3) c P(y(t), y(t)) = 0 a.e . 
and such that y minimizes 

F 5 (y) = [ 1 9q(y(t), y( T ) ) dT 

t o 

with respect to all other such functions (i-.e., with respect to all admissible 
arcs) . 


L Recall our earlier (chapter 4) convention regarding Y y 
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In chapter 4 we remarked that the functions 9 , 9 and v are C 1 
n+p n+p 

on R X R . The following lemma is an immediate consequence of the 
definitions of these functions and is merely an expression of the fact 
that 9 is independent of y n+1 , . . . , y n+p and y^, . y Q . 

Lemma 3-1 : For all (y, y)s R n+P xR n+P the following is true: 


(1) 


St. S9. Sen 

i i i 

577 = " 577 * 5T 

J J J 


Se. 


s. 

in 7 


iy 5yT 


= 0 


i = 1, 2, 

j = 1, 2, 


n 

n 


St. S0. St. S9. 
<2) 0> ^ 


i = 1, 2, n 

j = n+1 , n+p 


(3) 

with elements 


rank 




n where 



denotes the nx(n+p) matrix 


In anticipation of what will be needed later let us define, for arbitrary 
vectors A = (a q , A n )e R n+1 , S = ! n+p )e R n+P and 

(y, y) € R n+P x R n+P the (Lagrange) function 


h(y; y, A)= - 7 0 9 0 (y, y) 



\ v ± (y> y), 


i=l 


the (Weierstrass) excess function 


n+p 

E(y, y, i, A) = h(y, i, \) - h(y, y, a) (y, y, a), 

i=l 1 


and the (Pontryagin) function 


G(y> 


n 

i=0 


0 1 (y, y) • 
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We note that the functions h and G are on R n+ ^ x R 11 ^ x R n+1 . 

From the calculus of variations we obtain the following necessary 
conditions for y to be a solution to the Lagrange problem. We designate 
it as theorem 5*1 and accept it as being proved. It is from this theorem 
that the maximum principle for both Pg and P^ will be derived. 

Theorem 3-1 : If y is a minimal admissible arc with terminal time t^ 

for the Lagrange problem (jl.es., (yj t^)e , then there exists a nonzero, 
bounded, measurable function A = (a*q, •••> X* n ) on t^] with 

\ q <L 0 a constant, such that for almost all t eft^, t^] : 


(i) (y(t), y(t) , A*(t)) = 


8h 


u 0 


(y(s), y(s), A (s)) ds + c. 


for constants CL where i = 1, 2, ..., n+p, 

n+p 

(2) h(y( t) , y(t) , A*(t)) y^t) (y(t), y(t), A*(t)) 

i=l 1 


8h 

St 


(y(s), y(s), 


7\*(s) ) 


ds + C 


0 


for a constant C^, and 

(3) Efy(t), y(t) , i, A*(t)) > 0 for all vectors | 
satisfying cp(y(t), fj) = 0. 

In addition at the terminal time t = t^ it is necessary that there 
exist constants v^, ..., v n with (Aq, v^, ..., v^) =/ 0 such that 
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I 

j=l 


oa . 
J 

cTt~ 


(9; 


y(t x )) + c Q + 


1 (y(s), y(s), A*(s) ) 


0 


0 


and 

n 

Yj V j 3y7 + c i + f 1 g~ ' (y(s), y(s), A*(s)) ds = 0 

j=i 1 t o 1 


where i = 1, 2, . .., n+p. 


Theorem 5*1 is stated in full generality and, as yet, makes no use of 
the particular form h, E and 0 assume herein. At this time we choose 
to make use of this special form and, in the spirit of optimal control theory, 
interpret theorem 5-1 in terms of the function G defined previously. Before 
doing this, let us, for convenience, state the following lemma which is merely 
an algebraic consequence of lemma 5-1 and the relation 


n 

h(y, y, a) =2^ - G (y> y> *)• 

i=l 


Lemma 5-2 : For all y, ye R n+ ^ and Ae R n+ ^ 


( 1 ) 


5h _ 30 Bh _ ^ n 

= _ 577 > 377 = \ > 377 = 0 


i = 1, 2, 


n 


( 2 ) 

( 3 ) 


3h _ 3 g _ n _ 3h _ 3 g 

377 " 377 ' 377 377 


1 “ 1 
n+p 

. 3h 


1 “ 1 

n+p 


„ . . 3g 

jV i 5y7 " L y i 5yT 

i=l i=n+l 


i = n+1. 


n+p 



and for arbitrary £ e R' 


n+p 


(^) E(y, y, £, A) = G(y, y, 3) - G(y, £, \) 

n+p 

+ ) i (£ ± - y-p) 5I7 (v> i, a) • 

i=n+l 1 

If y is a minimal admissable arc for the Lagrange problem, then there 
exists a constant A*q < 0 and a function A* = (a*q^ A*^, • • • , A* n ) 

•X- 

satisfying the conditions of theorem 5 -l- Since A is bounded, measurable 
and is continuous we have, from lemma 3 .1, that (y(t), y(t), A*( t) ) 

is integrable and hence the functions A^> i = 1, 2, . .., n given by 

A.(t) = - f (y(s), y(s) , A*( s) ) ds + C 

d f u «y -i 

t o 1 

• c^G • x- 

are absolutely continuous and satisfy A^(t) = - (y(t), y(t), A (t)) 

a.e. on [t^, t^] . From (l) of theorem 5-1 and (l) of lemma 5-2 it follows 
that A^ = A*^ a.e. and thus, letting Aq = A*q we have that Aq 5- 0, the 
function A = (Aq> Ap, • • • j> A n ) is nonzero and absolutely continuous, and 
\(t) = - (y(t), y(t) , A(t)) a.e. 

Turning to the terminal conditions of theorem 5-1 we find that, since 
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for i = 1, 2, . .., n and j = n+1, . . . , n+p 

the constants C Q and C n+1 , . .., C n+p are zero. Therefore from (2) of 

lemma 5.2 and (l) of theorem 5.1, plus that fact that A = A* a.e., we 
obtain 

0 = (y(' t )> y(t), A(t)) 

a.e. for i = n+1, . .., n+p. 

From (2) of theorem 5.1 and (3) of lemma 5.2 we have that 

0 = G(y(t), y(t), A*(t)) 

a.e. on [t Q , t.^] and from (4) of lemma 5.2 we have G(y(t), y(t), A*(t))> 

G(y(t), £, A*(t)) a.e. for all vectors £ such that r p(y(t), £) =0. Since 

1 = 1* a.e. these two relations are also true for almost all t e[t Q , t^] 
with A* replaced by A. In other words we have established the following. 

Corollary 5-1 : If (y> t^)e then there exists a nonzero, absolutely 

continuous function A = (a q , ..., A fl ) , with A Q < 0 a constant, such 

that a.e. on [t~, t, ] : 

(l) 1 A^t) = - (y(t) , y(t), A(t)) i = 1, 2, ..., n 

(y(t), y(t), a( t) ) = o 


i = n+1 , . . . , n+p 



(2)' G(y(t), y(t), A(t)) = 0 


(3)' G(y(t), y(t), A(t)) > G(y(t), £, A(t)) 
for all vectors £ satisfying cp(y(t), £) = 0. 

5.2 Necessary Conditions for Pg 

Turning to problem Pg (see chapter 4) let us define, for arbitrary 

vectors (x, z, \)e R n x x R n+ \ the (Pontryagin) hamiltonian 

n 

H(x, z, A) Ajf^x, z). 

i=0 

From the properties of f. we have immediately that H is on 

R n xR P xR n+1 and furthermore that the following lemma is true. 

Lemma 5.3 : If the vectors (y, y)e R n+ ^ x E n+I> and (x, z)e R n xR^ 

satisfy y^ = x^ for i = 1, 2, ..., n and y n+ ^ = for i = 1 , 2, ..., p 

^ | ^ 

then (independent of y , . , . ... y , and y. , .... y ) for A£ R a 
x ■*- J n+r ’ ’'n+p 1’ n 

(1) H(x, z, A) = G(y, y, A) 

(2) ( x , z, a) = (y, y, A) i = 1, 2, ..., n 

i ^i 

(3) (x, z. A) = g-r — - (y, y, A) i = 1, 2, p. 

i °n+i 

From lemma 5 *3; the fact that is ^-equivalent to Pg, and corollary 

5.1, we obtain the maximum principle for Pg . 



Corollary 5-2 (Maximum principle for P p ) : If z:[t Q , t p ] -h> Z = R P 

is an optimal control with corresponding trajectory x and terminal time 
t p ' > *0 then ^ere exists a nonzero, absolutely continuous function 
i = (1 q, Y^, > >n ) with ?\q — 0 a constant, such that a.e. on [t^, t p ] : 

(1) " = - ( x (h)> z(t), A(t)) i = 1, 2, ..., n 

i 

( x (h), z(t) , A(t)) =0 i = 1, 2, ..., p 

i 

(2) " H(x( t) , z(t), A( t) ) = 0 

(3) " H(x(t), z(t), k(t)) >H(x(t), I, 7\(t) ) 
for all vectors £ = (£ p , ..., |^)e Z. 

Proof : If (z; x, t p )e fi°g then from theorem 4.1 there exists (y,t^)e 2° 

with x i (t) = y i (t) for i = 1, 2, ..., n and z^t) = y (t) for 

i = 1, 2, ..., p. Since (y, t^)e there exists a constant "\q "S. 0 and 

functions satisfying corollary 5.1. Thus from lemma 5.3 relations (l)" 

and (2)" follow and furthermore if T = (T , . . . , J ) satisfies cp(y(t), |) 

_L n+p 

then the vector £ = (¥ n+1 > • ••, I n+p )e z satisfies H(x(t), £, x(t)) 

= G(y(t), £, A(t)) which established (3)"* 

We remark that the relation = 0 follows from ( 3 )" and the fact 

- . 1 Z± 

that H is C on Z = R-^. Therefore, in this sense, the multiplier rule - 
when applied to this specific problem (P^) - contains redundant information. 

We turn to problem P p and define, for arbitrary vectors (x, u, \)e R n xVxR n+ ^ 



the hamiltonian H(x, u, a) = / 7\ ± g^x, u) . Again H is C 1 2 3 on 

i=0 

R n xVxR n+ ^ and we have the following lemma (recall X = R n and U C V C R m ) . 

Lemma 3*4 : If the vectors (x, z)e XxZ and (x ; u)e X xU satisfy 

t(z) = u then for A e R n+ ^ 


'l) H(x, u, A) = H(x, z , A) 


( 2 ) 577 ( x > u ’ ^ = 57 “ ( x > z > ^ 

i i 


m 

(3) y ^ r_4 ' x ^ 


3 ^ . 


/ 577 ( x ' Uj 577 ^ ~ 377 ( x ’ z >*)’ i = l, 2, 

j=l J 11 


If (u> x, t^)e then from theorem 3*4 there exists ( z> x, t^)e with 
t°z = u i.e_., ^(z(t)) = u(t) for all t e[t Q , t^] . Applying corollary 5*2 and 
lemma 5. 4 we obtain, as in the proof of corollary 5. 2, the maximum principle for 


5-3 The Pontryagin Maximum Principle 
Corollary 3-3 (Maximum principle for P^) : If (u> x, t^)e then 

there exists a nonzero, absolutely continuous function A = (Aq, A-^, •••, A n ) , 
with Aq 5 0 a constant, such that a.e. on [t^, t^] : 

(1) ' ' 1 i i (t) = - (x(t), u(t), A(t)) 

i 

(2) " ' H(x(t), u(t), A(t)) = 0 

(3) "' H(x(t), u(t), A( t) ) >H(x(t), |, A ( t ) ) 

for all vectors | = ( | )e U. 

1 m 


i = 1, 2, . . . , n 



45 


A few observations should be made concerning corollary 5-3 • First, 
we choose not to include the relation 

f ^tr cn|/ .(z(t)) 

/ (^(t), u(t), A(t)) = 0 a . e . 

j=l J 1 

for i = 1, 2, . .., p since, as indicated previously, it can be obtained 
from condition (3) ' ' ' and the definition of H. Second, we see that 
corollary 5-3 can be considered proved only for those particular control regions 
U for which the existence of a function ^ satisfying the strengthened basic 
assumption of chapter 4 is guaranteed. This existence question is considered 
in the next chapter. Third, the proof of corollary 5-3 (or 5-2) rests on 
the assumption that g (or f) is C 1 on V tU (or Z) . This is a slightly 
stronger assumption than the one made by Pontryagin et al. who only require 
continuity on U (or Z) . Finally, as discussed on pages 101-104 of (l8), 
one can show that the function 

a(t) = ^su Pu H(x(t), I, A(t)) 

is equal to zero for all t eft^, t^] and not just almost everywhere. 

Since a proof of the multiplier role, Weierstrass condition and trans- 
versality conditions, without any assumptions regarding normality and with 
the trajectories absolutely continuous, appears to be approximately as 
difficult as a proof of the maximum principle and since the proof of the 
maximum principle outlined in this chapter depends ultimately on Filippov's 
lemma and the existence of a function t satisfying the strengthened basic 
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assumptions, we do not advocate corollary 5*3 as an "easy" proof of the 
maximum principle for an arbitrary U. Rather, we present corollary 5-3 
as what it really is - one of (hopefully) many straightforward applications 
of the idea of ^-equivalence (or what is the same thing, the idea of a 
change of variables in the control space) . 
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6. THE EXISTENCE OF t 

In this the sixth chapter, we turn to the question of the existence of 
a function f satisfying various forms of what has been called (in chapters 3, 

4, and 5) the basic assumption. To facilitate later reference, let us 
systematically list these various forms, in order of decreasing generality, 
as follows . 

Basic Assumption : Corresponding to a nonempty, compact set U C R m 

(6-1) - there exists a positive integer p, a nonempty set ZC R^ and 
a continuous function ty:Z -» U with ty(z) = U; 

(6-2) - in addition to (6-1) there exists a compact set Z* C Z such 
that t(Z*) = U> 

(6-3) - in addition to (6-2), Z = R^ and (hence) U is connected; 

(6-4) - in addition to (6-3), t is on Z. 

Recall that, from theorem X , if (6-2) is satisfied then Pg is 
^-equivalent to P^ . Furthermore, if (6-3) is satisfied then Pg has an 
unconstrained control region. Finally, if (6-4) is satisfied then Pg (with 
some additional assumptions concerning the function g) is - in the sense of 
chapter 5 - a Lagrange problem. 

Rather than attack the question of the existence of in full generality, 
let us agree hereafter to consider control regions U which are compact and 
convex. It will soon be clear that what we lose in generality we will more 
than gain in practicality. Specifically, when U is compact and convex we 
will obtain not only existence theorems but also explicit representations for t. 
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Moreover, in most cases the form that ^ assumes will be of sufficient 
simplicity to guarantee that the transformation f^(x, z) = g.(x, 4(z)) is 
not only possible but also practical. At the end of this chapter we will 
(briefly) consider a particular type of compact, connected (but not, in 
general, convex) control region) namely we will let U be the continuous 
image of a compact, convex set. 


6.1 U is a Convex Body 

Definition 6.1 : For a positive integer p and real numbers a, b 

with a < b, let [a, b] P = [a, b] x... x [a, b] denote a p- dimensional 
cube while B P (p, z) = {x e R p : || x-z || < p f denotes 1 the p- dimensional 
ball of radius p > 0 centered at z e R p . 

If UCR m is compact and convex and if in addition U contains an 
interior point then dimension U = m (ll) and U is said to be a. convex 
body. If (J is a convex body, we will show that there exists a 
function t satisfying (6-3) . Moreover, when U is a polyhedron 
we will construct a rather simple function t satisfying (6-4). 

Finally, when U is a parallelipiped we will construct a very simple 
function t satisfying (6-4) with p = m. As a first step in 
this direction consider the following lemma which may be found (for example) 
in (3) • 

Lemma 6.1 : If U C R m is a convex body then U is homeomorphic to 
S (l.> 0) . Consequently, any two convex bodies in R m are homeomorphic. 

From lemma 6.1 it follows that if U is a convex body in R m and Z* 
is any other convex body in R then, with p = m and Z = Z*, there exists 


1 


Recall that for 


x e rP, || x || = 


P 

A 

L 

Li=l 


(*i>‘ 


1 

2 . 
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a function satisfying (6-2) i.e., ^:Z U is the homeomorphi sm whose 
existence is guaranteed by lemma 6.1, Therefore, we remark in passing 
that, in the sense of theorem 3-^ ? when U C R m is a convex body, there is 
no loss of generality in assuming that U is B m (l, 0), or [-1, l] m , or 
any other convex body in R m for that matter. Unfortunately, unless the 
homeomorphi sm f:Z -» U is (which in general it is not), we can not use 
lemma 6.1 to construct a function t satisfying (6-4). However, the 
following jLs true. 

Theorem 6.1 : If U C R m is a convex body then there exists a function 

D ■)(- rf Tt 1 ^ / s' \ 

t:R^ -» U, with m = p and Z = ~ 2. * 2 \ w hich satisfies (6-3) i_. e, 

t is continuous and t(R m ) = t(z'*) = U. 

Proof : Since both U and [-1, l] m are convex bodies in R m , from 

lemma 6.1 there exists a homeomorphism ep: [ — 1 , l] m -»U. Consider the function 
7 = (7^, ..., 7 m ) : R m ->R m defined by 

7 ^(z) = sin z^ i = 1 , 2 , ..., m 

where z = (z^, ..., z^) . Clearly 7(R m ) = 7([-7j>7j] IU ) = [-1> l] m and 
since 7 is continuous the theorem follows with t = cp°7- 

Certainly, in the proof of theorem 6.1, if the function r p were 
then 4 would be also and hence t would satisfy (6-4) . Also notice the 
fact that t is one-to-one is not used in the proof. These two observations 
serve to indicate the course we will pursue) namely, we will relax the one-to-one 
requirement on t in order to obtain the condition t € C^ . At the same time 
by assuming that U is a polyhedron we will be able to obtain an explicit 
representation for 
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be 


6.2 U is a Polyhedron 

Definition 6.2 : For ra > 1 and r 1 let w 1 for 

111 

r+1 vectors in R . The set given by 

/ r , X 

| u s R m : u = ^T V- i > 0, ^ Mi = 1 

' i=0 i=0 


i = 0, 1, 


r 


is said to be a polyhedron (or a convex polytope ) . As defined herein, a 
polyhedron is the convex hull of its vertices jw°, . . . , w r j- (3). If 
r = m and if the set { w 1 - w°, . . ., w r - w° }■ is linearly independent 
then the polyhedron is said to be a simplex . (See (10), chapter 5 •) 

Clearly a polyhedron is compact and convex although, in general, it 
may or may not be a convex body. Geometrically a simplex is a compact line 
segment for m = 1, a compact triangular region for m = 2, a compact solid 
tetrahedron for m = 3> and so forth, and is, for all m, a convex body. 

If U C R m is a polyhedron defined by r+1 vertices 
then there exists a Y function <p:R r -+ R m with cp(B r (l,0)) = U. 
hypothesis we have 

( r r , 

U=jueR m :u=^ iuw 1 , [u > 0, ^ ^i = 1 

' i=0 1=0 



Lemma 6.2 : 

(0 r i 

\ w , . . ., w J 

Proof: By 


Notice that if u e U then we have 
r r 

Y i 0 Y i 

u = / u.w- 1 = p„w + / p.w 

U 1 0 A 

i=0 i=l 



i=0 


r 

0 \ i 

w u + j n i w 

i=l 
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so that 



i=l 


Therefore , U may be written equivalently as 


_m 0 

u € R : u = w + 


H i (w 1 - w°), ii ± > 0, 2^ \i ± < 1 


i=l 


i=l 


r m 

Consider cp:R -> R defined by 


cp(z) = w° (zq) 2 (w 1 - w°) 

i=l 


where z = ( z^, . .., z r ) • Certainly cp is C^ and cp(B x (l, 0))cU. 


Moreover, if u e U then u has a representation as w u + ^ P^w 1 - w< ^) 

i=l 

r 

with p^ > 0 and ^ p^ < 1. Defining z = (z^, z^) as z^ = -J"p7 

i=l 


for i = 1, 2, v then z e B r (l, 0) and cp(z) = u ,i..e., <p(B r (l, 0)) 3U 

Notice that <p is definitely not one-to-one. In fact, even when U 

1 2 12 2 2 

is a simplex, we can only conclude that cp(z ) = cp(z ) implies (z = (z 
This follows from the linear independence of the set { w" 1 " - w°, . . . , w r - w° | 
However, of course, when U is a simplex then cp is one-to-one on (for 
example) the set B m (l, 0) O | z e R m ; z^ ^ 0, i = 1, 2, ...,m } . 

Theorem 6.2 : If U C R m is a polyhedron defined by the r+1 vertices 
| w^, ..., w 1 | then there exists a C^ function t:R r -> U (_i ._e . , p = r) 
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with Z* = B r (l, 0) such that t(R r ) = t(Z*) = U _i.e., t satisfied (6-4). 
Such a function is given by 


Q sin 2 £ !lz| 
¥(z) = w U + -2— 


l|z|| 


( \ 2 ( 1 
(z ± ) (w - W ) 


^ 0 


i=l 


t(O) = w 


z = 0 


where z = (z^, . . . , z^) . 

Proof : Let cp denote the function given by lemma 6.2 and consider 

7 :R r -» R r defined by 

sin ^ ||z|| 

7(z) = z z / 0 

|lz|| 

7(0) =0 z = 0 

for z = (z^, z r ) . Since II 7( z ) •! = sin ^ ||zll <1 it follows that 

7 (R r ) C B r (l, 0) . Now, if u € B r (l, 0) with ||u|| / 0 define z = ( z z^) 
as .. 

2 sin H u ll 

z = u 

* iiuii 


where 

. -l 
sin 

is 

the 

inverse of sin: 

r_ £ 

L 2 > 

INI 

2 . - 
= — sin 

-1 

iiuii 

we have ||z|| £ 

1 and 




7(z) 

. it / 2 . -1 

Sln 2 \ k Sin 

iiuii j 




2 -1 

ii 


] -» [- 1, 1] . Since 


— sm 

it 



u 


u 


nun 
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If || u |! = 0 then u = 0 and 7 ( 0 ) =0 by definition. Therefore, 

7(B r (l, 0)) hB r (l, 0) and since 7(R r ) b7(B r (l, 0)) this proves that 
7(R r ) = 7(B r (l, 0)) = B r (l, 0). 

1 T 

To prove that 7 is C on R it is sufficient to prove that 

sin | || z. || sin | vs" 

is C . However, the function a(s) = (s f 0), 

llzll Vs 

3 

a(o) = ^ , a'(0) = j is c 1 on R and P is C"* - on R r where 

p(z) = llzll = + . . . + z^ 

sin | ||z|| r 

which implies <xop(z) = is C on R . Therefore, the 

llzll 

function t = 9 0 7 is on R r and satisfies t(R r ) = t(B r (l, 0)) = U 


which proves the theorem. 

Remark 6.1 : If UClf is a simplex then theorem 6.2 is true with 

p = r = m i.e., ^:R m -> U and ^(R m ) = U. 

Remark 6.2 : The proof of theorem 6.2 is certainly true for any 

function 7:R r -» R r which satisfies 7(R r ) = 7(Z*) = B r (l, 0) where 
* r sin | ||z|| 

Z C R is compact. Whereas theorem 6.2 uses the function 7(z) = z 

llzll 

we could have chosen, for example, the following function which is suggested 
by a consideration of r-dimensional spherical coordinates (( 13), page ^- 27 ). 

7*(z) = sin z^ cos z g 
7*(z) = sin^ z^ sin Zg cos z 


* / ; . 2 

7 . (z) = sin z_, sin z_ ... sin z , cos z 
r-l' 1 2 n-1 n 

-x- / \ 2 

7 (z) = sin z n sin z„ ... sin z , sin z 
1 2 n-1 n 
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with Z* = [- ~ , %] x [■ j x, t t] . Certainly 7* = ( 7 *, . 7*) is C" 1 " 

and it can be established with an inductive argument that 

7*(R r ) = 7*( z*) = B r (l, 0). Hence i|r* = cp o 7 * also satisfies (6-4). 


6.3 U is a Parallelipiped 

Definition 6.5 : For m > 1 let w^ for i = 1, 2, . . ., m be m 

linearly independent vectors in R m and let w® e R m be arbitrary. The set 

m 

| u € R m :u = w^ + N (i^w^, 0 < pu < 1 j- is said to be a (m- dimensional) 
parallelipiped . 

It follows from the definition that a parallelipiped is compact and 
convex. Furthermore, a parallelipiped is the convex hull of its 2 m vertices 
and hence is a polyhedron. Certainly then, theorem 6.2 applies to the case 
where U CR m is a parallelipiped and thus there exists a function 
ih rP ■■■ -—- ^ *-U satisfying (6-4) with p = 2 m - 1. However, we will show 
(theorem 6 . 3 ) that when U is a parallelipiped there exists a function 
^.p>P — tj satisfying (6-4) with p = m and that t assumes a much 
"simpler" form than the form of the function given by theorem 6 . 2 . 

Lemma 6.5 : If U c R m is a parallelopiped then there exists a C^ 
function cp:R m -» R m such that <p( [ -1, l] m ) = U. In addition t is one-to-one. 

Proof : Denote the m+1 vectors which define U by w®, w , ..., w m . 

Define cp:R m R m as m 

<P(z) = w ° + | ^ ( z i + 
i=l 


where z = ( z.. , 
z. + 1 1 

0 < _L_ < 1 


. .., z ) . Certainly cp is C^ 
which implies cp([-l, l] m )cu. 


and if z e[-l, l] m then 
Furthermore, if u e U then 


2 
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u 


w 


m 


u. .w 


where G < (i . 5 1. Define z = ( i 


z ) by 
m ° 


i=l 


z^ = 2\j.. - 1 then cp(z) = u which establishes cp([-l, l] m ) D U. 

The fact that <p is one-to-one follows from the linear independence 
of the set jw 1 , . .., w m } . For, if cp(z' 1 ") = cp(z 2 ) then 

m m 

(z\ + 1) w 1 { Z \ + l)w X 

i=l i=l 


which becomes 


m 



i=l 


5-) wl 


= 0 


1 2 

and thus z = z ^ for i = 1, 2 , . .., m. 

Theorem 6.5 : If U c R m is a parallel ipiped defined by the m+1 vectors 
w°, w 1 , w m in R m then there exists a C 1 function i|f:R m R m 

(i.e., t satisfies (5- 1 0). Such a function is given by 

m 

t(z) = w° + i ^ (sin z 1 + l) w 1 
i=l 


where z = (z, . .., z m ) * 

Proof : Let cp denote the function given by lemma 6.3 and, as in the 

proof of theorem 6.1, define 7:R m -» R m by 


7^(z) = sin z^ i = 1, 2, . . ., m. 

Certainly t = T°7 is C 1 and since 7(R m ) = 7(Z*) = [-1, l] m the theorem 


follows . 
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For m > 1 and real number a., b. with a. < b n - for i = 1, 2, m 

we will designate the set [a^, b^] x. . ,x [a^, b^] as an (m-dimensional) 
right parallelipiped . Frequently, in the literature on optimal control 
theory (especially in the literature dealing with linear problems) it is a 
control region of this type which is considered. This is due both to the 
mathematical simplicity of such a set and to its physical relevance, i .e . , a 
right parallelipiped corresponds physically to a control system in which the 
m controllers are free to move independently (of each other) within a range 
determined by the upper and lower bounds a^, b^. 

From definition 6.3 it is easy to see that a right parallelipiped is, 
in fact, a parallelipiped. Specifically, it is the parallelipiped defined 
by the vectors 

w - (a^, • • • ) a m ) 
w = (b-^ - * • o 6, 0) 

w = (0, 0, . . . , 0, b^ - a^) . 

Therefore in the particular case of a right parallelipiped the function 
given by lemma 6.3 reduces to 

/ b. - a. \ / b. + a. \ 

9j_( z ) ~ \ 2 / z i + \ 2 / i = l,2, ...,m 


and as a corollary to theorem 6.3 we obtain the following. 
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Corollary 6.1 : If U = [a^ b ] x...x [a , b ] then the 

function t:R m -> R m given by 

/ b . - a. \ /b.+a. \ 

t.(z) = [ 1 g 1 ) sin z. + ( ---- - g 1 ) i = 1, 2, .... m 

satisfies (6-4) with Z* = ^ ^ | ] m and p = m. 

As mentioned previously, frequently in optimal control theory one 
assumes that the control region is a right parallelipiped. Using the 
function cp defined by lemma 6.3 and letting t = cpj^ where 
Z = Z = [-1, 1] we find that satisfies (6-2) with p = m. This 
result, in conjunction with theorem 3*4, justifies the frequently made 
remark that when U C R m is a right parallelipiped one can assume without 
loss of generality that U = [-1, l] m . In fact, as lemma 6.3 shows, this 
remark is true in the more general case where U is any parallelipiped. 
Furthermore, when P^ is linear in the controls i_._e., when 

m 

g 1 (x, u) = a j ,(x) + ^ b_(x)u. i = 1, 2, . .., n 
i=l 

then f ± (x, z) = g^(x, t(z)) is linear in the controls z. That is, 

the function cp given by lemma 6.3 transforms a linear problem P 

with U a parallelipiped into a linear problem P^ with Z = Z = [-1, 1] . 

We remark in passing that there is certainly nothing unique about the 
function t in theorem 6 . 3 . For example, the function defined by 
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satisfies theorem 6. 3 with Z* = [-1, l] m . In practice, the form of 
Pj may help to determine which t function should be used. 

In (17), there is a discussion of a derivation of the Bang-Bang 
principle for a linear, time- optimal system with U = [-1, l] m . This 
discussion is in the spirit of chapter 5 and uses the function given 
by corollary 6.1 and the (implicit) knowledge that the vertices of U 
correspond to the points z e Z where z± = ± 1 for i = 1, 2, ..., m. 
This indicates that, at least in principle, the function given by 
theorem 6.2 could be used to derive the Bang-Bang principle for a 
linear time-optimal system with U a polyhedron. For this (and other) 
reasons it is of interest to determine those points of Z* = B r (l, 0) 
which correspond to the vertices of U i^.e., to determine those points 
z e Z* such that t(z) = w 1 for some i = 0, 1, 2, ..., r. Inspection 
of the function t given by theorem 6.2 reveals that t(0) = w° and 
that for z = (l, 0, ..., 0), ..., z = (0, 0, ..., 0, l) we have 

^( z ) = w 1 , i = 1, 2, ..., r. In fact, t(z) = w® for all z e Z with 

l|z|| = 0, 2, 4, ... and t(z 1 ) = w^ where i = 1, 2, . . ., r and 

z = az for a = ±1, ±5, ±5, • • • • Thus, ^ maps B (l, 0) and 

successive shells of thickness 2 onto U. 

6 . 4 - U is the Continuous Image of a Convex Body 

In the paragraph preceding definition 6.1 we mentioned the case 
where U is the continuous image of a compact convex set. The following 
result is a consequence of theorems 6.1, 6.2, and 6.3 and 
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summarizes conditions sufficient to guarantee that there exist an 
onto function -> U which satisfies ( 6 - 3 ) or ( 6 - 4 ). 

Theorem 6.4 : Let V C R^ be a convex body and let cp-.V -> R m 
be continuous. Letting U = cp(v), there exists an onto function 
t:R^ -» U satisfying (6-3). If V is a polyhedron with r+1 vertices 
and if cp is on an open set containing V then there exists an 

onto function t:R r -» U satisfying (6-4). Finally, if V is a 
parallelipiped and if cp is C^" on an open set containing V then 
there exists an onto function t:R^ -» U satisfying (6-4). 

Proof : For each of the three sets VC R^ considered there exists 

an onto function :R^ ->V with P = q., P = r and p = q_ satisfying 
(6-3) , (6-4), and (6-4) respectively, for an appropriate set Z*C R^, 
The theorem follows directly using ^ = cp oi|r*. 
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7. SUMMARY AND CONCLUSIONS 

Based upon several natural equivalence relations defined on a 
large class of optimization problems an equivalence concept, called 
'I'- equivalence, was developed. In terms of this concept the fundamental 
question became: given an optimization problem, say P^, under what 

assumptions is it possible to construct a second optimization problem, 
say Pg, which is ^-equivalent (hence equivalent) to P^. In very 
general terms a complete answer to this question was provided by 
corollary 2.2. 

If an optimal control problem (P-^) has U c R 111 as its (compact) 
control region and if a second optimal control problem (Pg) has ZCR^ 
as its control region, where Pg is obtained by transforming P^ using 
the onto function t:Z -» U, then we may apply corollary 2.2. Thus, we 
obtained that, if t maps the set of admissible controls for Pg 
(point- by-point) onto the set of admissible controls for P , then Pg 
is ^-equivalent to P^. From theorems 3-1 and 3*2 it was possible to 
rephrase this condition as theorem 3*3’» namely, if for each bounded 
measurable function u:[t Q , t-^] -» U there exists a bounded measurable 
function z:[t Q , t^] -> Z such that u= t° z then Pg is ^-equivalent 
to P^. Filippov's lemma guarantees the existence of such a function 
z if there exists a compact set Z*C Z such that t(Z) = ^(Z*) = U 
and thus this lemma reduced the question of ^-equivalence to a 
question of the existence of a function ^ satisfying various forms of 
what was called the basic assumption. These various forms were 
conveniently summarized at the beginning of Chapter 6. 
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In chapter 6 we showed that if U is a convex body in R m 
then an appropriate exists and in fact is equivalent to an 

optimal control problem P^ in which the control functions are 
unconstrained (i.e., Z = R P ) . Furthermore, when U is a polyhedron 
we may choose ^ to be C^" and thus P^ becomes equivalent to a 
Lagrange problem. As an application of this result we were able to 
derive the Pontryagin maximum principle using only the standard necessary 
conditions of the calculus of variations. 

While these results constitute an answer to the fundamental question 
posed earlier, at least in the setting of optimal control and calculus of 
variation problems with absolutely continuous trajectories, they suggest 
several further areas of research, a few of which will be mentioned 
briefly. 

If, instead of bounded measurable control functions, one considers 
controls which are (for example) piecewise continuous then it appears that 
all of what has been done herein could be redone in this case provided that 
a 'piecewise continuous analogue" of Filippov's lemma is available. This 
would certainly be of interest since the vast majority of the traditional 
calculus of variations research and much of the optimal control research 
has been concerned with the piecewise continuous case. 

As indicated by the proof of the maximum principle (chapter 5) there 
is reason to believe that the method discussed herein will provide a very 
effective means of relating other known results in the two areas. This 
should at least be of interest to those people with backgrounds in the 
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calculus of variations who (by desire or by necessity) are doing research in 
optimal control theory. 

It is a fact that today most of the existing algorithms for solving 
optimal control problems are based on the explicit or implicit assumption 
that the control region is open. Therefore, by transforming P^ with U 
compact, into P h with Z = R^, we may immediately apply the algorithm to 
Pg, rather than attempting to modify the algorithms to treat P^ directly. 
Since it is also a fact that virtually all optimal control problems must 
be solved numerically, this last research area may prove to be of immediate 
importance . 
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9 • APPENDIX 

In chapter 2 we were concerned with global minimization and consequently 
with the concept of global weak ^-equivalence. Throughout this appendix we 
will be concerned with local minimization and specifically we will attempt 
to obtain conditions sufficient to guarantee that P g is locally weakly 
^-equivalent to P-^ where Pg is defined as in the sentence prior to 
lemma 2.1. As might be expected, these conditions are obtained only by 
greatly strengthening the properties of ^ :fig -»fi^. 

Using the notation and terminology of chapter 2, let us, in addition 
to the assumptions of chapter 2, assume that Y^ and Yg are topological 
spaces and that T^ and Tg are the relative topologies for fi-^C Y^ and 
figC Yg. Furthermore, assume that ^ifi,., -»fi^ is continuous . Let us agree 
to the terminology that a neighborhood of a e of p e fig) 

is an open set N^e T^ (N^ e Tg) containing a (p). 

Definition 9.1 : If for a°e (p° e fig) there exists N^o e T q (N^o e Tg) 

such that F^(a°) S. F^(a) for all a e N^o (Fg(p°) S Fg(p) for all p e N^o) 
then a° (p°)is said to be a local optimal element . 

We let fi°°C fi^ and fi°°C ^2 ^ eno ^ e 6he set of all local optimal 
elements and we observe that fi° C fi°°, fig C fi°? With a slight abuse of 
notation we will refer to the problems of finding local optimal elements for 
fi^ and fig as and Pg. 

Lemma 9»f : If a e fi^ and p e fig and ^(p) = a then a e fi°° 


implies p e fig . 
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Proof: By hypothesis there exists e in which a is locally 

optimal. Since is continuous there exists e such that ^(Np)c 
Consider p* e N^ and let a* =^(p*)e N^. Since F-^a*) 2: F^a) we have 
F 2 (P*) ^ Fg(p) and thus p e 2°°. 

Lemma 9*2 : ^(D 2 )bf2°° iff 

Proof : The lemma is trivally true if f2°° = 0. Therefore assume 

fi°° 0 0 and consider a° e Q °° . If ^ (& 2 ) bfi°° then by hypothesis there 
exists p° e such that ^(p 0 ) = a° and from lemma 9*1* P° e 2°° i *£ ’> 
^(2°°) b U?° . The converse is certainly true since ^ 2 ^ ®2° ’ 

Lemmas 9*1 and 9*2 and their proofs are vertual repetitions of lemmas 2.1 
and 2.2 and are true because of the continuity of ^ . The analogue of 
lemma 2.3, however, requires more of ^ than just continuity. With this in 
mind we define the following terminology. 

Definition 9*2 : The continuous function 'I' :2g -» 2^ is locally open 
( relative to 2 2 °) if for each p° e 2°° there exists a N^o e T2, in which 
P° is optimal, which satisfies ^ (N^oje T^. 

Observe that if ^ is locally open then it is only necessary for some 
neighborhood of p° € 2°°, in which p° is optimal, that its image be open 
in T^. Certainly if \f r is open then ^ is locally open. 

Lemma 9. 3 : \l>(fl° 0 )C 2°° iff SI> is locally open. 

Proof : If 2°° = 0 the lemma is vacuously true. Consider p° € 2°° 

and suppose ( 2°° ) C 2°° . By hypothesis a° = ^(p°)e 2°° and thus there 

exists N o e T, in which a° is optimal. Since ^ is continuous, 
a 1 

^ _1 (N o) e T 0 and p° e ^ -1 (w °) . Letting N R o = ^ _1 (W o) we have 

CC d (X P CL 

\E(jL'o) = N o which is open in Y n . Consider p e N„o then a =^(p)e N o 

'Pa 1 p a 
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so that F,(a) > F^(a°) which implies Fg(f3) > Fg(3°). Thus 3° is optimal 
in NpO which implies 'I / is locally open. 

Conversely suppose ^ is locally open and consider 3 q e fig 0 . By 

hypothesis there exists N^o e Tg such that \&(NpO)e and such that 

3° is optimal in N^o . Let a° = ^(3°) and define N^o = 'i'(NpO). 

Consider a € N o then there exists 3 € N n o such that ^(B) = a and 

a 3 

since Fg(p) > Fg ( 3° ) we have F^(a) > Fg(a°) . Thus a° e fi°° and hence 
- 1 ,/ n OO \ ,^00 

^2. * 

As the analogue of definition 2.2 we have the following. 

Definition 9-3 : Pg i- s sa ^ to t* e locally weakly jfr - equivalent to P^ 

iff *(fi°°) = fi°°. 

Theorem 9-1 : Pg is l° ca Hy weakly ^-equivalent to iff ^ is locally 

open and ^(fig) D fi°° . 

Proof : If (fi 00 ) = then from lemma 9-3 ^ is locally open and from 

lemma 9-2, ^(fig) ^fi 00 . Conversely if \E r is locally open then from lemma 9-3 
^(fi°°) while if D fi°° then lemma 9*2 implies 'I r (fi 00 ) bfi°°. 

While theorem 9>1 gives a necessary and sufficient characterization of 
local weak equivalence , it assumes a priori a knowledge of fig°. Thus 
the following corollary is a far more practical result. 

Corollary 9 -l : If 'J/ is (continuous and) open with 'I' (fig) = ^ then 

Pg is locally weakly 'I'- equivalent to P^. 

Proof : If ^ is open then it is locally open and ^(fig) = fi^ implies 

'Jf (fig) ^ fi^° thus theorem 9*1 applies. 

Just as corollary 2.2 served as the basis for the study of global 
^-equivalence in optimal control theory, corollary 9-1 would play the same 
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role in a study of local -equivalence . Certainly, however, the construc- 
tion of a function \E r mapping the space of admissible control function 
into the space of admissible control functions which is at the same 

time onto , continuous and open appears to be, at best, a difficult job. 

In closing let us apply the results of chapter 2 and this appendix to 
the problem of minimizing a function of n-variables. Specific all y, let 
Y^ = R n and Yg = R^. From corollary 2.2 we obtain the following. 

Corollary 9*2 : Corresponding to 0 ^ R n and F.:iL -» R suppose 

there exists a positive integer p, a nonempty set fig C R p and a function 
-*ft 1 with = fl 1 . In this case where 

denote the set of global minimal elements of F., :ft, -» R and F_ = F, ° >&: -*R. 

-1- -L _L 2 

Notice that ^ need not be continuous. An obvious application of 
corollary 9.2 is to construct = R p and thus to replace the problem of 
minimizing F^(x) subject to the constraint x e by the problem of 
minimizing Fg subject to no constraints. If one is concerned with local 
minima then corollary 9*1 applies and one obtains the following. 

Corollary 9*3 : In addition to the hypotheses of corollary 9*2 if ^ 

is open and continuous then ^(^2°°) = fi°° where fi°°, denote the 

set of local minimal elements. 



